An Iterative Phase Recovery Framework with Phase Mask for Spectral Mapping with an Application to Speech Enhancement
نویسندگان
چکیده
We propose an iterative phase recovery framework to improve spectral mapping with an application to improving the performance of state-of-the-art speech enhancement systems using magnitude-based spectral mapping with deep neural networks (DNNs). We further propose to use an estimated time-frequency mask to reduce sign uncertainty in the overlap-add waveform reconstruction algorithm. In a series of enhancement experiments using a DNN baseline system, by directly replacing the original phase of noisy speech with the estimated phase obtained with a classical phase recovery algorithm, the proposed iterative technique reduces the log-spectral distortion (LSD) by 0.41 dB from the DNN baseline, and increases the perceptual evaluation speech quality (PESQ) by 0.05 over the DNN baseline, averaging over a wide range of signal and noise conditions. The proposed phase mask mechanism further increases the segmental signal-to-noise ratio (SegSNR) by 0.44 dB at an expense of a slight degradation in LSD and PESQ comparing with the algorithm without using any phase mask.
منابع مشابه
Iterative refinement of amplitude and phase in single-channel speech enhancement
While the state-of-the-art speech enhancement methods are focused on the modification of the noisy spectral amplitude, our recent findings demonstrate positive impact of incorporating the speech phase spectrum in speech enhancement. In this show and tell proposal, we demonstrate the recent progress towards utilizing the phase information in closed-loop iterative manner leading to the joint enha...
متن کاملShow & Tell: Iterative Refinement of Amplitude and Phase in Single-channel Speech Enhancement
While the state-of-the-art speech enhancement methods are focused on the modification of the noisy spectral amplitude, our recent findings demonstrate positive impact of incorporating the speech phase spectrum in speech enhancement. In this show and tell proposal, we demonstrate the recent progress towards utilizing the phase information in closed-loop iterative manner leading to the joint enha...
متن کاملSpeech Enhancement Using Iterative Kalman Filter with Time and Frequency Mask in Different Noisy Environment
The main aim of the Speech Enhancement algorithms is to improve the Quality of speech. The Quality of speech is expressed in two parameters. One is clarity, and another is intelligibility. In this paper, we proposed a method to improve the quality of speech based on computationally efficient AR modeled Iterative Kalman Filter with time and frequency mask. This approach is based on reconstructio...
متن کاملUtilizing Kernel Adaptive Filters for Speech Enhancement within the ALE Framework
Performance of the linear models, widely used within the framework of adaptive line enhancement (ALE), deteriorates dramatically in the presence of non-Gaussian noises. On the other hand, adaptive implementation of nonlinear models, e.g. the Volterra filters, suffers from the severe problems of large number of parameters and slow convergence. Nonetheless, kernel methods are emerging solutions t...
متن کاملResolution enhancement for advanced mask aligner lithography using phase-shifting photomasks.
The application of the phase-shift method allows a significant resolution enhancement for proximity lithography in mask aligners. Typically a resolution of 3 µm (half-pitch) at a proximity distance of 30 µm is achieved utilizing binary photomasks. By using an alternating aperture phase shift photomask (AAPSM), a resolution of 1.5 µm (half-pitch) for non-periodic lines and spaces pattern was dem...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016